fix: preserve string literals during schema qualification stripping (#371) by tianzhou · Pull Request #372 · pgplex/pgschema

tianzhou · 2026-03-24T16:46:08Z

Summary

stripSchemaQualificationsFromText used regex patterns that matched schema prefixes inside single-quoted SQL string literals, corrupting e.g. 's.manage' into 'manage' when the target schema was s
This caused pgschema plan to generate destructive false-positive ALTER POLICY statements that silently truncated function arguments in USING/WITH CHECK expressions
Added stripSchemaQualificationsPreservingStrings which splits text on single-quoted string boundaries before applying schema stripping, preserving string literal content
Added fast-path strings.Contains check to skip all work when the schema name is absent

Fixes #371

Test plan

Unit tests: go test -v ./internal/postgres -run TestStripSchemaQualifications_PreservesStringLiterals (6 cases covering string preservation, escaped quotes, mixed identifiers/literals)
Diff tests: PGSCHEMA_TEST_FILTER="create_policy/" go test -v ./internal/diff -run TestDiffFromFiles (added scope_check policy with has_scope('public.manage') to existing alter_policy_using test case)
Integration tests: PGSCHEMA_TEST_FILTER="create_policy/" go test -v ./cmd -run TestPlanAndApply (all 10 policy tests pass, no false-positive diff for string literal policy)

🤖 Generated with Claude Code

Copilot

Pull request overview

This PR fixes schema-qualification stripping during pgschema plan so that schema prefixes inside single-quoted SQL string literals (e.g. 's.manage') are preserved, preventing destructive false-positive diffs for RLS policies.

Changes:

Adds string-literal–aware schema qualification stripping by splitting non–dollar-quoted SQL on single-quoted boundaries before applying regex-based stripping.
Adds a fast-path to skip processing when the schema name does not appear in the SQL text.
Adds unit tests and extends diff fixtures to cover the RLS policy string-literal regression (Issue #371).

Reviewed changes

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
internal/postgres/desired_state.go	Updates schema qualification stripping to preserve single-quoted string literals and adds a `strings.Contains` fast-path.
internal/postgres/desired_state_test.go	Adds unit tests validating schema stripping while preserving string literals and escaped quotes.
testdata/diff/create_policy/alter_policy_using/old.sql	Extends policy diff fixture with a policy using a dotted string literal (`'public.manage'`).
testdata/diff/create_policy/alter_policy_using/new.sql	Extends policy diff fixture with the same string-literal policy to ensure no false-positive diff.
testdata/diff/create_policy/alter_policy_using/plan.json	Updates the expected plan fingerprint hash after fixture changes.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-24T16:50:32Z

internal/postgres/desired_state_test.go

+			sql:      "SELECT 'public.a', public.t, 'public.b';",
+			schema:   "public",
+			expected: "SELECT 'public.a', t, 'public.b';",
+		},


The new unit tests cover basic string-literal preservation, but they don’t cover the failure mode where apostrophes appear inside SQL comments (e.g., -- don't ...) and interfere with the string-splitting state machine. Adding a test case with a comment containing a single quote followed by a schema-qualified identifier would help prevent regressions once comment-handling is fixed.

Suggested change

},

},

{

name: "handles apostrophe in comment followed by schema-qualified identifier",

sql: "SELECT 1; -- don't drop public.t\nDROP TABLE public.t;",

schema: "public",

expected: "SELECT 1; -- don't drop public.t\nDROP TABLE t;",

},

Copilot · 2026-03-24T16:50:32Z

internal/postgres/desired_state.go

+	inString := false
+
+	i := 0
+	segStart := 0
+	for i < len(text) {
+		ch := text[i]
+		if ch == '\'' {
+			if !inString {
+				// End of non-string segment — strip schema qualifications from it
+				result.WriteString(stripSchemaQualificationsFromText(text[segStart:i], schemaName))
+				segStart = i
+				inString = true
+				i++
+			} else {
+				// Check for escaped quote ('')
+				if i+1 < len(text) && text[i+1] == '\'' {
+					i += 2 // skip ''
+				} else {
+					// End of string literal — write it as-is
+					inString = false
+					i++
+					result.WriteString(text[segStart:i])
+					segStart = i
+				}
+			}
+		} else {
+			i++
+		}
+	}


stripSchemaQualificationsPreservingStrings treats every single quote as the start/end of a SQL string literal, but it doesn’t account for quotes appearing inside SQL comments (e.g., -- don't ...). A single apostrophe in a -- or /* ... */ comment can flip inString and cause the rest of the SQL text to be treated as “string”, preventing schema qualification stripping for subsequent statements (potentially creating objects in the real schema instead of the temp schema). Consider extending the splitter to recognize and skip over line/block comments (and only toggle inString when not inside a comment).

Suggested change

inString := false

i := 0

segStart := 0

for i < len(text) {

ch := text[i]

if ch == '\'' {

if !inString {

// End of non-string segment — strip schema qualifications from it

result.WriteString(stripSchemaQualificationsFromText(text[segStart:i], schemaName))

segStart = i

inString = true

i++

} else {

// Check for escaped quote ('')

if i+1 < len(text) && text[i+1] == '\'' {

i += 2 // skip ''

} else {

// End of string literal — write it as-is

inString = false

i++

result.WriteString(text[segStart:i])

segStart = i

}

}

} else {

i++

}

}

inString := false

inLineComment := false

inBlockComment := false

i := 0

segStart := 0

for i < len(text) {

ch := text[i]

if inString {

if ch == '\'' {

// Check for escaped quote ('')

if i+1 < len(text) && text[i+1] == '\'' {

i += 2 // skip ''

continue

}

// End of string literal — write it as-is

inString = false

i++

result.WriteString(text[segStart:i])

segStart = i

continue

}

// Inside string literal, just advance

i++

continue

}

if inLineComment {

// Line comment ends at newline

if ch == '\n' {

inLineComment = false

}

i++

continue

}

if inBlockComment {

// Block comment ends at */

if ch == '*' && i+1 < len(text) && text[i+1] == '/' {

inBlockComment = false

i += 2

} else {

i++

}

continue

}

// Not in string or comment

if ch == '-' && i+1 < len(text) && text[i+1] == '-' {

// Start of line comment

inLineComment = true

i += 2

continue

}

if ch == '/' && i+1 < len(text) && text[i+1] == '*' {

// Start of block comment

inBlockComment = true

i += 2

continue

}

if ch == '\'' {

// Start of string literal. End of non-string segment — strip schema qualifications from it.

result.WriteString(stripSchemaQualificationsFromText(text[segStart:i], schemaName))

segStart = i

inString = true

i++

continue

}

i++

}

Copilot · 2026-03-24T16:50:32Z

internal/postgres/desired_state.go

+			if !inString {
+				// End of non-string segment — strip schema qualifications from it
+				result.WriteString(stripSchemaQualificationsFromText(text[segStart:i], schemaName))
+				segStart = i


stripSchemaQualificationsFromText compiles multiple regexes on every call. With the new string-literal splitting, it can now be invoked many times per SQL statement (one per non-string segment), which can significantly increase CPU/time on large schemas. Consider compiling the regexes once per schemaName per stripSchemaQualifications invocation (or caching by schemaName) and reusing them across segments.

greptile-apps · 2026-03-24T16:51:39Z

Greptile Summary

This PR fixes a correctness bug in stripSchemaQualifications where regex-based schema prefix removal was applied to the contents of single-quoted SQL string literals, corrupting expressions like has_scope('public.manage') into has_scope('manage') and causing false-positive ALTER POLICY statements in plan output.

Key changes:

Adds stripSchemaQualificationsPreservingStrings — a character-level scanner that splits text on single-quoted string boundaries, applying stripSchemaQualificationsFromText only to non-string segments and writing string literals through untouched.
Adds a strings.Contains fast-path in stripSchemaQualifications to skip all processing when the schema name is absent from the input.
Adds 6 unit tests covering string preservation, ''-escaped embedded quotes, mixed identifier/literal inputs, and multiple literals in a single statement.
Updates the alter_policy_using diff test case with a scope_check policy that exercises the fix end-to-end; the policy correctly produces no diff.

Two minor edge cases to be aware of:

A single quote appearing inside a -- line comment will cause the scanner to enter inString = true, suppressing schema stripping for all subsequent text until a matching closing quote is found. This is unlikely in practice but worth documenting.
The E'...' escape-string form (where \' embeds a quote) is not recognised. The failure mode is conservative — identifiers after the real end of such a string may not be stripped — but it's not a correctness regression over the prior code.

Confidence Score: 5/5

Safe to merge — fixes a real false-positive corruption bug with correct logic, comprehensive tests, and no regressions.
The core fix (splitting on single-quoted string boundaries before applying regex stripping) is logically sound and thoroughly tested for all common SQL string forms. The two noted edge cases (-- comment quotes and E'...' backslash escapes) are pre-existing limitations of the parsing pipeline that fail conservatively (missed stripping, never corruption). The diff test fixture directly validates the end-to-end scenario from the bug report. No existing tests are broken.
internal/postgres/desired_state.go — review the two P2 edge cases in stripSchemaQualificationsPreservingStrings (comment-embedded quotes and E'...' escape strings) before expanding the feature further.

Important Files Changed

Filename	Overview
internal/postgres/desired_state.go	Introduces `stripSchemaQualificationsPreservingStrings` which splits on single-quoted string boundaries before applying regex-based schema stripping. Logic is correct for standard `''`-escaped strings; two minor edge cases exist (single quotes inside `--` comments, and `E'...'` backslash-escape form) that are pre-existing limitations and fail conservatively (missed stripping, never corruption). Also adds a `strings.Contains` fast-path optimization.
internal/postgres/desired_state_test.go	Adds 6 targeted unit tests for `stripSchemaQualifications` covering: basic stripping, single-quoted string preservation, short schema names, mixed identifier/literal, `''`-escaped quotes, and multiple literals. Coverage is thorough for the stated fix.
testdata/diff/create_policy/alter_policy_using/new.sql	Adds `has_scope` helper function and `scope_check` policy with `has_scope('public.manage')` to serve as the regression fixture for Issue #371. Correctly absent from `plan.json` because old.sql and new.sql are identical for this policy.
testdata/diff/create_policy/alter_policy_using/old.sql	Mirrors new.sql by adding the same `has_scope` function and `scope_check` policy, ensuring no false-positive diff is generated. Only the `user_tenant_isolation` USING expression differs between the two files.
testdata/diff/create_policy/alter_policy_using/plan.json	Hash updated for the new source SQL; plan still only contains the expected `ALTER POLICY user_tenant_isolation` step, confirming `scope_check` produces no false-positive diff.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A["stripSchemaQualifications(sql, schemaName)"] --> B{schemaName == empty\nor not in sql?}
    B -- Yes --> C[Return sql unchanged\nfast-path]
    B -- No --> D["splitDollarQuotedSegments(sql)"]
    D --> E{Segment\ndollar-quoted?}
    E -- Yes --> F[Write segment as-is\npreserve function bodies]
    E -- No --> G["stripSchemaQualificationsPreservingStrings(seg, schemaName)"]
    G --> H{Scan character by character}
    H --> I{char == single-quote?}
    I -- Not in string --> J["Flush non-string segment\nto stripSchemaQualificationsFromText()\nEnter string mode"]
    I -- In string, next is also quote --> K["Skip '' pair\nstay in string mode"]
    I -- In string, next is not quote --> L["Write string literal as-is\nExit string mode"]
    I -- Not a quote --> M[Advance i]
    J --> H
    K --> H
    L --> H
    M --> H
    H -- End of text --> N{Still inString?}
    N -- Yes --> O[Write remaining as-is\nunterminated literal]
    N -- No --> P["Apply stripSchemaQualificationsFromText()\nto trailing segment"]
    F --> Q[Reassemble result]
    O --> Q
    P --> Q

Comments Outside Diff (1)

internal/postgres/desired_state.go, line 120-164 (link)

Single-quote inside -- comment mis-parses subsequent identifiers

If a SQL -- line comment contains a single quote (e.g. -- it's ready), the parser enters inString = true and treats every subsequent character on all following lines as inside a string literal. This means schema qualifications on those lines will silently not be stripped.

Example:
```
-- it's a note
SELECT public.users;
```
With the current code, public.users will not be stripped because the ' in the comment triggers the "start of string" branch.

This is a pre-existing limitation of the un-comment-aware parsing pipeline, but the introduction of stripSchemaQualificationsPreservingStrings makes this concern real for multi-statement DDL files containing annotated SQL (which the test data in this very PR does — see the -- Policy with string literal… comments). Since those comments do not contain quotes there is no regression in the test cases, but the boundary case is unguarded.

Consider either:
- Skipping characters after -- until \n before entering the quote-scanning loop, or
- Documenting this limitation in the function's doc comment.

_{Reviews (1): Last reviewed commit: "fix: preserve string literals during sch..." | Re-trigger Greptile}

greptile-apps · 2026-03-24T16:51:46Z

internal/postgres/desired_state.go

+			} else {
+				// Check for escaped quote ('')
+				if i+1 < len(text) && text[i+1] == '\'' {
+					i += 2 // skip ''
+				} else {


E'...' backslash-escaped quote not handled

The escape-string syntax E'it\'s public.test' uses \' (backslash + single quote) to embed a quote rather than '' (doubling). The current parser only recognises the '' form:

if i+1 < len(text) && text[i+1] == '\'' { i += 2 // skip '' }

With E'content\'' suffix, the \ is consumed as an ordinary character; then when ' (the real escaped quote) is reached and the following char is ' (the string closer), the code treats both as a '' pair and skips them — leaving inString = true beyond the true end of the string. Any schema identifiers after the closing quote will therefore be treated as string content and not stripped.

The failure mode is a false-negative (missed stripping) rather than a false-positive (corruption), so it is safe but subtly incorrect. A doc comment or a test case capturing this limitation would make the boundary explicit.

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 1 comment.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-26T07:16:34Z

internal/postgres/desired_state.go

+func stripSchemaQualificationsPreservingStrings(text string, schemaName string) string {
+	var result strings.Builder
+	result.Grow(len(text))
+	inString := false
+
+	i := 0
+	segStart := 0
+	for i < len(text) {
+		ch := text[i]
+		if ch == '\'' {
+			if !inString {
+				// End of non-string segment — strip schema qualifications from it
+				result.WriteString(stripSchemaQualificationsFromText(text[segStart:i], schemaName))
+				segStart = i
+				inString = true
+				i++
+			} else {
+				// Check for escaped quote ('')
+				if i+1 < len(text) && text[i+1] == '\'' {
+					i += 2 // skip ''
+				} else {
+					// End of string literal — write it as-is
+					inString = false
+					i++


stripSchemaQualificationsPreservingStrings toggles inString on every single quote without accounting for SQL comments (-- ... / /* ... */). A single apostrophe inside a comment (e.g., -- don't strip) can cause the rest of the file to be treated as an unterminated string, skipping schema stripping for subsequent statements and potentially executing schema-qualified DDL against the real schema instead of the temp schema. Consider extending the scanner to recognize and skip comment regions (or strip comments before scanning), and add a unit test that includes an apostrophe in a comment before a public.-qualified reference.

…371) stripSchemaQualificationsFromText used regex patterns that matched schema prefixes inside single-quoted SQL string literals. For example, with schema "s", Pattern 4 treated the single quote in 's.manage' as a valid non-double-quote prefix character, corrupting 's.manage' into 'manage'. This caused pgschema plan to generate destructive false-positive ALTER POLICY statements that silently truncated function arguments. Add stripSchemaQualificationsPreservingStrings which splits text on single-quoted string boundaries (handling '' escapes) and applies schema stripping only to non-string segments. Also add a fast-path strings.Contains check to skip all work when the schema name is absent. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

…parser Add doc comment and test case capturing the known limitation where E'...' escape-string syntax causes the single-quote parser to mistrack boundaries. The failure mode is safe (false-negative: unstripped qualifier). Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Apostrophes in SQL comments (-- don't ...) could flip the string-tracking state, causing schema qualifiers after the comment to not be stripped. Also cache compiled regexes per schema name for better performance. Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 3 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-26T08:35:40Z

internal/postgres/desired_state.go

+		re1: regexp.MustCompile(fmt.Sprintf(`"%s"\.(\"[^"]+\")`, escapedSchema)),
+		re2: regexp.MustCompile(fmt.Sprintf(`"%s"\.([a-zA-Z_][a-zA-Z0-9_$]*)`, escapedSchema)),
+		re3: regexp.MustCompile(fmt.Sprintf(`(?:^|[^"])%s\.(\"[^"]+\")`, escapedSchema)),
+		re4: regexp.MustCompile(fmt.Sprintf(`(?:^|[^"])%s\.([a-zA-Z_][a-zA-Z0-9_$]*)`, escapedSchema)),
+	}


The unquoted-schema regexes (re3/re4) can match the target schema name as a suffix of a longer identifier because the only prefix check is (?:^|[^"]). For short schema names (e.g., schemaName=s), a reference like sales.table can be partially matched and rewritten incorrectly (e.g., corrupting sales.table to saletable). Consider requiring a non-identifier boundary before the schema name (start or a char not in [a-zA-Z0-9_$"]) and preserving the delimiter via a capture group rather than consuming an arbitrary preceding character.

Copilot · 2026-03-26T08:35:40Z

internal/postgres/desired_state.go

+	result = sr.re3.ReplaceAllStringFunc(result, func(match string) string {
 		// If match starts with a non-quote character, preserve it
 		if len(match) > 0 && match[0] != '"' {
-			// Extract the quote identifier (everything after schema.)
 			parts := strings.SplitN(match, ".", 2)
 			if len(parts) == 2 {
 				return string(match[0]) + parts[1]
 			}
 		}


stripSchemaQualificationsFromText’s re3/re4 replacement assumes the first byte of match is the delimiter to preserve. When the regex matches at the beginning of the string (because of the ^ alternative), match[0] is actually the first letter of the schema name, producing wrong output (e.g., public.t -> pt). The regex/replacement should distinguish “start-of-string” vs “preceded by delimiter” (typically by capturing the prefix as a group and reusing it in the replacement).

Copilot · 2026-03-26T08:35:41Z

internal/postgres/desired_state.go

+// Limitation: E'...' escape-string syntax uses backslash-escaped quotes (E'it\'s')
+// rather than doubled quotes ('it''s'). This parser only recognises the '' form.
+// With E'content\'', a backslash-escaped quote may cause the parser to mistrack
+// string boundaries, resulting in schema qualifiers after the string not being
+// stripped (false-negative). This is safe — it preserves the original SQL — and
+// E'...' strings are extremely rare in DDL schema definitions.


The comment claims the E'...' limitation is “safe”, but the current parser can still strip schema qualifiers inside an E-string literal, changing object definitions and potentially reintroducing destructive false-positive diffs (the same failure mode as #371, just for E-strings). Either handle E-strings (treat backslash-escaped quotes as non-terminating) or clarify in the docs/tests that this case can change semantics and is not guaranteed safe.

…lacements - Use [^a-zA-Z0-9_$"] boundary instead of [^"] to prevent matching schema name as suffix of longer identifiers (e.g., schema "s" no longer matches "sales.total") - Use capture groups with ReplaceAllString instead of ReplaceAllStringFunc to correctly handle start-of-string matches - Clarify E-string limitation doc: both false-negative and false-positive are possible Co-Authored-By: Claude Opus 4.6 (1M context) <noreply@anthropic.com>

Copilot

Pull request overview

Copilot reviewed 5 out of 5 changed files in this pull request and generated 2 comments.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-26T08:56:07Z

internal/postgres/desired_state.go

+
+	for i < len(text) {
+		ch := text[i]
+
+		// Start of single-quoted string literal
+		if ch == '\'' {
+			flushCode(i)
+			i++ // skip opening quote
+			for i < len(text) {
+				if text[i] == '\'' {
+					if i+1 < len(text) && text[i+1] == '\'' {
+						i += 2 // skip escaped ''


stripSchemaQualificationsPreservingStrings does not handle Postgres escape-string literals (E'...') with backslash-escaped quotes. With input like E'it\'s public.test' FROM public.t;, the scanner will treat the \' as a string terminator, causing the later ' to be treated as a new (unterminated) string. This can produce invalid SQL and/or skip stripping after the string. Consider detecting E-strings and treating a quote as escaped when preceded by an odd number of backslashes (or avoiding splitting on ' when preceded by E and then using an E-string-aware scan).

Suggested change

for i < len(text) {

ch := text[i]

// Start of single-quoted string literal

if ch == '\'' {

flushCode(i)

i++ // skip opening quote

for i < len(text) {

if text[i] == '\'' {

if i+1 < len(text) && text[i+1] == '\'' {

i += 2 // skip escaped ''

isIdentChar := func(b byte) bool {

return (b >= 'a' && b <= 'z') ||

(b >= 'A' && b <= 'Z') ||

(b >= '0' && b <= '9') ||

b == '_'

}

for i < len(text) {

ch := text[i]

// Start of single-quoted string literal (standard or Postgres E'...' escape string)

if ch == '\'' {

// Detect Postgres escape string literal prefix E'...' or e'...'

eString := false

literalStart := i

if i > 0 && (text[i-1] == 'E' || text[i-1] == 'e') {

// Ensure the preceding 'E' is not part of a larger identifier

if i-1 == 0 || !isIdentChar(text[i-2]) {

eString = true

literalStart = i - 1

}

}

flushCode(literalStart)

i++ // skip opening quote

for i < len(text) {

if text[i] == '\'' {

if eString {

// In an E-string, a quote is escaped if preceded by an odd number of backslashes.

backslashes := 0

j := i - 1

for j >= 0 && text[j] == '\\' {

backslashes++

j--

}

if backslashes%2 == 1 {

// Escaped quote, keep scanning literal.

i++

continue

}

}

if i+1 < len(text) && text[i+1] == '\'' {

i += 2 // skip escaped '' (standard SQL escaping)

Copilot · 2026-03-26T08:56:07Z

internal/postgres/desired_state_test.go

+			// Known limitation: E'...' escape-string syntax with backslash-escaped quotes
+			// is not handled. The parser treats \' as ordinary char + string-closer,
+			// mistracking boundaries. Here it strips inside the string (wrong) and
+			// misses the identifier after (also wrong). Both are safe: the SQL remains
+			// valid, and the unstripped qualifier just means the object is looked up
+			// in the original schema. E'...' in DDL is extremely rare.
+			name:     "E-string with backslash-escaped quote (known limitation)",
+			sql:      "SELECT E'it\\'s public.test' FROM public.t;",
+			schema:   "public",
+			expected: "SELECT E'it\\'s test' FROM public.t;",


The “E-string with backslash-escaped quote” case assumes the output remains valid SQL (and expects a specific transformation), but the current splitter will treat \' as a string terminator and can end up producing an unterminated string literal / skipping subsequent stripping. Once E-string handling is fixed in stripSchemaQualificationsPreservingStrings, update this expected value to reflect the corrected behavior (and consider asserting validity/round-trip expectations for E-strings explicitly).

Suggested change

// Known limitation: E'...' escape-string syntax with backslash-escaped quotes

// is not handled. The parser treats \' as ordinary char + string-closer,

// mistracking boundaries. Here it strips inside the string (wrong) and

// misses the identifier after (also wrong). Both are safe: the SQL remains

// valid, and the unstripped qualifier just means the object is looked up

// in the original schema. E'...' in DDL is extremely rare.

name: "E-string with backslash-escaped quote (known limitation)",

sql: "SELECT E'it\\'s public.test' FROM public.t;",

schema: "public",

expected: "SELECT E'it\\'s test' FROM public.t;",

// E'...' escape-string syntax with backslash-escaped quotes should be

// handled correctly: the string literal is preserved exactly, and schema

// qualifications are stripped only from identifiers outside of strings.

name: "E-string with backslash-escaped quote",

sql: "SELECT E'it\\'s public.test' FROM public.t;",

schema: "public",

expected: "SELECT E'it\\'s public.test' FROM t;",

Copilot AI review requested due to automatic review settings March 24, 2026 16:46

Copilot started reviewing on behalf of tianzhou March 24, 2026 16:46 View session

Copilot AI reviewed Mar 24, 2026

View reviewed changes

greptile-apps bot reviewed Mar 24, 2026

View reviewed changes

tianzhou requested a review from Copilot March 26, 2026 07:11

tianzhou force-pushed the fix/issue-371-preserve-string-literals-in-schema-stripping branch from 3e36065 to 750359e Compare March 26, 2026 07:12

Copilot started reviewing on behalf of tianzhou March 26, 2026 07:12 View session

Copilot AI reviewed Mar 26, 2026

View reviewed changes

tianzhou and others added 2 commits March 26, 2026 01:05

tianzhou force-pushed the fix/issue-371-preserve-string-literals-in-schema-stripping branch from 750359e to b1dcba8 Compare March 26, 2026 08:07

tianzhou requested a review from Copilot March 26, 2026 08:30

Copilot started reviewing on behalf of tianzhou March 26, 2026 08:31 View session

Copilot AI reviewed Mar 26, 2026

View reviewed changes

tianzhou requested a review from Copilot March 26, 2026 08:51

Copilot started reviewing on behalf of tianzhou March 26, 2026 08:51 View session

Copilot AI reviewed Mar 26, 2026

View reviewed changes

tianzhou merged commit 955a2ea into main Mar 26, 2026
5 checks passed

-		},
+		},
+		{
+			name: "handles apostrophe in comment followed by schema-qualified identifier",
+			sql:  "SELECT 1; -- don't drop public.t\nDROP TABLE public.t;",
+			schema:   "public",
+			expected: "SELECT 1; -- don't drop public.t\nDROP TABLE t;",
+		},

-	for i < len(text) {
-		ch := text[i]
-		// Start of single-quoted string literal
-		if ch == '\'' {
-			flushCode(i)
-			i++ // skip opening quote
-			for i < len(text) {
-				if text[i] == '\'' {
-					if i+1 < len(text) && text[i+1] == '\'' {
-						i += 2 // skip escaped ''
+	isIdentChar := func(b byte) bool {
+		return (b >= 'a' && b <= 'z') ||
+			(b >= 'A' && b <= 'Z') ||
+			(b >= '0' && b <= '9') ||
+			b == '_'
+	}
+	for i < len(text) {
+		ch := text[i]
+		// Start of single-quoted string literal (standard or Postgres E'...' escape string)
+		if ch == '\'' {
+			// Detect Postgres escape string literal prefix E'...' or e'...'
+			eString := false
+			literalStart := i
+			if i > 0 && (text[i-1] == 'E' || text[i-1] == 'e') {
+				// Ensure the preceding 'E' is not part of a larger identifier
+				if i-1 == 0 || !isIdentChar(text[i-2]) {
+					eString = true
+					literalStart = i - 1
+				}
+			}
+			flushCode(literalStart)
+			i++ // skip opening quote
+			for i < len(text) {
+				if text[i] == '\'' {
+					if eString {
+						// In an E-string, a quote is escaped if preceded by an odd number of backslashes.
+						backslashes := 0
+						j := i - 1
+						for j >= 0 && text[j] == '\\' {
+							backslashes++
+							j--
+						}
+						if backslashes%2 == 1 {
+							// Escaped quote, keep scanning literal.
+							i++
+							continue
+						}
+					}
+					if i+1 < len(text) && text[i+1] == '\'' {
+						i += 2 // skip escaped '' (standard SQL escaping)

Conversation

tianzhou commented Mar 24, 2026

Summary

Test plan

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

greptile-apps bot commented Mar 24, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Flowchart

Comments Outside Diff (1)

Uh oh!

greptile-apps bot Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

greptile-apps bot commented Mar 24, 2026 •

edited

Loading